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ABSTRACT 


Recommending courses to students is a fundamental and also 
challenging issue in the traditional university environment. Not 
exactly like course recommendation in MOOCs, the selection and 
recommendation for higher education is a non-trivial task as it 
depends on many factors that students need to consider. Although 
many studies on this topic have been proposed, most of them only 
focus either on historical course enrollment data or on models of 
predicting course outcomes to give recommendation results, 
regardless of multiple reasons behind course selection behavior. To 
address such a challenge, we first conduct a survey to show the 
underlying characteristic of the course selection of university 
students. According to the survey results, we propose a hybrid 
course recommendation framework based on multiple features. Our 
experimental result illustrates that our method outperforms other 
approaches. Also, our framework is easier to interpret, scrutinize, 
and explain than conventional black-box methods for course 
recommendation. 
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1. INTRODUCTION 


Course selection in university is a crucial and challenging problem 
that students have to face. It is difficult to decide which courses 
they should take because there are a large number of courses 
opened each semester and students have to spend a lot of time 
exploring those courses. Moreover, the decisions they make shape 
their future in ways they may not be able to conceive in advance. 


We collected a dataset during 2015 and 2018 from our university to 
gain a better understanding of the elective course enrollment 
patterns. Figure 1(a) presents the distribution of the enrolled course 
number of students on the left and the distribution of the popularity 
for each course of our university on the right. There are hundreds 
of elective courses offered by the university while averagely 
students only select a few of them to satisfy the requirements for 
their degree program. Figure 1(b) shows the distribution of the 
enrolled courses for each semester. We can also see that students 
may take courses in the first two years mostly 
(semester1~semester4), because they may potentially be busy with 
an internship or finding jobs in the third and fourth year. 
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(b) #Course distribution for each semester. 
Figure 1. Distribution of courses. 


From the discussion above, a safe conclusion could be drawn that 
due to a large number of available but unfamiliar courses, course 
selection is a critical activity for students. 


With the increasing amount of available data about undergraduate 
students and their enrollment information, data-driven methods 
supporting decision making have gained importance to empower 
student choices and scale advice to large cohorts [14]. Many 
relevant studies on course recommendation focus on online 
learning platforms such as MOOCs. Other studies on course 
recommendation use datasets collected in physical university 
environments, however, they rely on approaches that are similar to 
the ones used in recommending MOOC courses without fully 
considering the different reasons involved in course selection 
process in physically-based university environments. 


In fact, course recommendation for higher education can be more 
“messy and unorganized” [1] as it depends on many factors that 
students need to concern. Intuitively, the reasons behind course 
selection are manifold. Likewise, students who enrolled in the same 
course may have completely different orientations based on their 
own reasons, which serves as different criteria for course selection 
[34]. It inspires us to try to find more useful features for the 
recommendation. 


To make the point clear, a survey is conducted on 81 students in 
our university to better understand student perceptions and attitudes 
for their course selection process. [10] Figure 2 shows the main 
underlying reasons for their course selection. 


Proceedings of The 13th International Conference on Educational Data Mining (EDM 2020) 460 


Interested 
High-grade 
Usefulness 
Social factor 
Others 


i 


Figure 2. The distribution of main reasons for course selection. 
(1) Interest 


As Figure 2 shows, the overall most important factor was students’ 
interest and it is often taken as a main contributing factor to the 
recommendation. However, students may not choose courses based 
purely on their interest in the university environment. It is expected 
that students will be more inclined to choose courses that do not 
require too much effort or difficulty. For example, some students 
would not enroll in a course which contains contents they are 
interested in, they just choose the course that allows them to get 
credits easily. 


(2) High-Grade 


Improperly selecting courses would seriously affect the students’ 
course achievements, which enforces students to drop out [12]. 
Getting relatively high grades for students is another factor 
influences student’s choice especially for successful students. 
Some students even prefer to choose what they perceived would be 
an easier course for fear that a tougher course might lower their 
GPA. 


(3) Learning goal and career plan 


It is natural to recommend courses that align with student’s learning 
goals and career plans as students consider the usefulness of 
courses as an important factor in their course selection process. 
However, first-year students may lack learning goals and career 
planning for the future, and the choice of courses is aimless. Also, 
student interest and goal can change as they explore and discover 
something meaningful on and off campus. 


(4) Social Aspect 


Social factor also plays a part in the course selection process. For 
example, some students prefer to enroll in a course with their 
friends or classmates together. Potts et.al [21] conclude that the risk 
of social isolation is a problem in the learning process especially 
for first-year students at university, who have difficulty navigating 
their new academic and environment. Tinto [22] concludes that 
participation in a collaborative learning group encourages student’s 
attendance and class participation. Therefore, the classmates or 
friends based social links could be important information in course 
recommendation. 


(5) Popularity 


As shown in Figure l(a), the long-tail distribution of course 
popularity indicates that students are more motivated to choose 
popular courses as their first choice. However, the popular courses 
will be filled up quickly while others will not be selected by 
students frequently. 


In summary, all these discussions above indicate that there are 
complex constraints and contexts that have to be considered 
together to balance all those factors above, made more difficult by 
the multiple objectives that students want to maximize and risks 
they want to hedge against. For example, choosing challenging 
courses of value while maintaining a high GPA [16]. This suggests 


that recommendations that are aimed only at one or a few factors 
are likely not enough to help the students. 


To address these challenges which have not been well explored in 
the research community, we propose our hybrid course 
recommendation framework, which incorporates different criteria 
in a modular way. Moreover, in our approach selection criteria can 
further be prioritized by the student. We believe that weaving those 
criteria could increase the usability of our recommendations 
compared to previous work focusing only on one of the two. Also, 
our framework is very efficient and easy to interpret. 


2. RELATED WORK 


2.1 Course selection 

Some work has been done on analyzing the college students’ course 
selection. Morsy and Karypis [23] investigated how the student’s 
academic level when they take different courses, relate to their 
graduation GPA and time to degree. This study suggests that 
course recommendation approaches could use this information to 
better assist students towards academic success, by graduating on- 
time with high GPA. Also, understanding students’ reasons for 
enrolling in a course provides key information for recommending 
courses and improving students’ learning experiences [24-27]. 


Additionally, there is still a lack of study on the factors that 
influence students’ course selection in university and how the 
course selection would impact the students’ educational 
achievement. 


2.2 Personalized Course Recommendation 
Various approaches have been used in applications for course 
recommendation by learning from historical enrollment data [32, 
33]. 


Content-based filtering approaches recommend a course to a 
student by considering the content of the course and clustering 
course and student into groups to gain similarity between them [2,3]. 
Collaborative filtering approaches recommend a course to a student 
by investigating student's similarity with the student's historical 
data in a system and predict the course that the student would be 
interested in [4-6]. Association rules based on frequent patterns are 
used to discover interesting relations that describe previous course 
selections from students [8,9]. Recently, other methods including 
sequence discovery and representation learning have been used in 
this domain [11,19,20]. However, those systems often behave like 
a “black box", i.e., recommendations are presented to the users, but 
the rationale for selecting recommendations is often not explained 
to end-users. 


2.3 Grade Prediction 


While some researchers have focused on between-course 
enrollment data, others have focused on models of predicting 
grades in future courses [13-6]. Based on what courses they 
previously took and how well they performed in them, the predicted 
grades give an estimation of how well students are prepared for 
future courses, then recommending courses to students that will 
help them to get relatively high grades [18,28,29,31]. 


However, these methods can be prone to recommending relatively 
easier courses in which students usually get high grades [17]. In 
addition, there are some students who like challenge difficult 
courses if they are interested in or think it is helpful for their future 
career, for those students, the grade prediction based 
recommendations are not enough. 
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Despite the significant success of various course recommendations, 
constraints on the number of student preferences in the university 
environment resulting in inflexibility where a_ student’s 
requirements do not align perfectly with those built into the system. 
In contrast to the aforementioned approaches, our model combines 
the concerns of performance and interest together. Also, it has the 
benefit of allowing for a custom weighting of those components, as 
well as the increased explanatory value of the model itself. 


3. PROPOSED METHOD 


We first give the definition of our recommendation problem in 
Section 3.1. Then we propose our hybrid course recommendation 
framework with three subsections introducing our Interest-based 
Score, Timing-based Score, and Grade-based Score in detail. 
Finally, those different scores are used in our course 
recommendation algorithm introduced in Section 3.3. 
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Figure 3. Overview of the proposed course recommendation. 


3.1 Problem Formulation 

Like every classic recommendation task, there are two basic 
elements user and item in our course recommendation task, where 
a user represents a student and an item represents a course. We use 
S to denote a set of students and C to denote a set of courses. Each 
s €S has enrolled some courses denoted by C, € C and eachc € 
C has its enrollment set denoted by S, & S. Let T denote a set of all 
available semesters, and t to denote a specific semester. Generally, 
there are 8 semesters for 4 academic years degree program. Let G 
denote a set of grades that student could get, and each g € G denote 
a specific grade that student obtained for a course. Let E = 
{(s,c,g,t)|s €S,c € Cy,g € G,t € T} be the set of all enrollment 
relations, which means student s enrolled in course c in semester t, 
and got the final grade g. 


Given enough students enrollment data (S,C,E), our goal is 
recommending courses to a specific student s which are not in C; 
for next semester. 


3.2 Framework 

According to the result of the survey shown in Section 1, students 
may concern different factors while they choose courses. Inspired 
by that, we propose our hybrid course recommendation framework 
that considers student interest, the timing of taking the course and 
the predicted grade of the student together. Figure 3 shows the 
overview of the proposed course recommendation. 


For each pair of student and course (s,c), we need to understand 
how suitable the course is for the specific student. We use three 
different aspects to calculate the Score(s,c) for each pair of 
student and course: 


(i) Interest based Score (IS) , which is to measure how 
interesting the course is for a_ specific student. (ii) 


Timing based Score (TS), which is to measure how suitable 
students enroll in the course at a specific time (semester) since 
different courses may have different suitable time periods. (iii) 
Grade based Score (GS) , which is to predict students’ 
performance for the course. 


We propose our approaches to estimate IS , TS and GS, 
respectively. Then, they are fused by a student-specific weight 
parameter as the Score(s,c,t). Once all of the Score(s,c,t) have 
been computed, the k courses with the highest score are selected. 


3.2.1 Interest-based Score 
Let s and c be a student and a course, respectively, the goal of 
interest score estimation is to calculate IS(s,c). 


In our framework, we extract user interest from student historical 
enrollment behaviors. Since each course of university belongs to a 
category, let CATE denote the set of all categories, cate to denote 
a specific category, then CATE = {cate,,catep,...,cate|carg|}- 
We think that there is a strong relationship between student interest 
and course categories. For instance, a student frequently enrolls in 
courses which belong to “Computer Science” may imply that the 
student has an interest in this category or he may have personal 
learning goal in this domain. Hence, it is appropriate to recommend 
the student the courses such as “Python Programming” and “Data 
science”. 


For a student s, the idea is to count the number of courses that he 
enrolled in and belongs to a category, i.e., Num(s, cate). Then, all 
of the values are normalized as the preference score from 0 to 1, 
denoted as p(s, cate), which is defined as equation (1). 


Num(s, cate) 


p(s, cate) = (1) 
MaXcaterecarE (Num(s, cate )) 


For a student s, the preference vector P,, is obtained by the 
preference score of each category, which is defined as equation (2). 


P, = (p(s, cate,), p(s, cate2), pls, cate\care|)) (2) 
We can further use P, to calculate the similarity between student s 
and other students. Let s; and s; be two students, the similarity 


between s; and s; can be measured by the cosine similarity 
measurement as shown below. 


Ps," “Ps, 
I[Ps, ll x |}Ps, 


For the convenience of computation, we use a matrix form 
representation P = P’,. ; P’,, p25 p’ ss) to denote the interest 


(3) 


sim(s; ,5;) ca 


of all students where p’, = p,/||p,|| means the normalization of 
p,. Then the similarity matrix Sim can be simply written as: 


Sim = P™ x P (4) 
where Sim; j is the result of sim(s; ,5j)- 
Based on the similarity, we could estimate the user-based interest 
score. For a student s and a course c, the Interest Score denoted as 
IS(s,c), is defined as (5), where S,; indicates the set of top-k 
similar students of s as neighbors, and J,., is an indicator function 
whose value is 1 whenc € Cs’. 
Vsresex Ic,, x sim(s ,s’) 

ses, , SIM(S , S') 


Furthermore, we try to utilize students’ major information together 
with their similarity as equation (6). 


IS(s,c) = (5) 


simMp(s,s’) = Asim(s ,s') + (1 —A)samemajor(s,s') (6) 
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Where samemajor(s,s') function equal to 1 if student s and 
student s’ have the same major, otherwise, the function equal to 0. 
A (limited from 0 to 1) is used to control the weight between 
similarity and major information. The underlying rationale is that 
each major has its owner preference on courses enrolling, students 
have the same major will generally make a similar choice in course 
selection. Also, students in the same major are more likely to be 
friends or classmates, which brings potential social link 
information into the course recommendation. Then equation (5) can 
be rewritten as below. 


Uses, Cy X SHMm(S, 5’) 


Doves,, SES, 5") 


IS(s,c) = (7) 


3.2.2 Timing and popularity based Score 

Different courses may have different suitable time periods 
(semesters). For example, in each department, courses can be taken 
by students of different grades, e.g., freshman or sophomore. 
Previous studies showed that the timing of courses has a strong 
correlation with student graduation GPA and time to degree [23]. 
Based on that, we assume that the timing of courses is also 
important for course selection. The suitable timing of courses will 
help students for good grades and successful graduation in a timely 
manner. 


For each course c, we define the Timing based Scores (TS), 
denoted as TS(c,t), where t indicates a specific semester. In our 
framework, TS is considered from two aspects: 


(1) Which semester is more suitable for taking this course? For a 
specific course, we sum up the number of enrollments for every 
semester and normalize all of the values. The result is denoted as 
T,(c,t). 

Num(c, t) 

; (8) 

Max;ter(Num(c, t )) 
where Num(c, t) represents the number of enrollments of course c 
in semester t, and T indicates the set of all time periods, i.e., 8 
semesters for 4 academic years degree program. 


T,(¢,t) = 


(2) Which courses are popular now? For a specific semester, we 
sum up the number of enrollments for every course and normalize 
all of the values. The result is denoted as T, (c, t). 


Num(c, t) 


T,(c,t) = 
pct) Maxeec(Num(c’,t)) 


(9) 


where C indicates the set of all courses. T,(c,t) and T,(c, t) are 
then fused by the harmonic mean since we want both of the two 
values are relatively high. The final Timing based Score, 
TS(c, t) can be defined as: 


T,(c,t) x T,(c,t) 
T,(c,t) + T,(c,t) 
Therefore, we can use TS(c,t) to ensure that the semester t is 


suitable for taking the course c and the course c is suitable for 
taking in the semester t. 


3.2.3 Grade-based Score 

Improperly selecting courses would seriously affect the students’ 
course achievements, which may decrease their GPA even enforce 
students to drop out. Accurately predicting students’ grades in 
future courses has attracted much attention as it can help identify 
at-risk students early [30]. 


TS(c,t) =2*x (10) 


We use the grade prediction method called cross-user-domain 
collaborative filtering proposed by Ling et al. [12]. For predicting 
the score of each coursec € C for each student s € S ,asmall set 


of senior students who have already enrolled on course c and have 
the most similar previous score distribution to student s will be 
discovered by means of Pearson correlation coefficient. The 
underlying rationale is that students with similar scores in the 
previous courses will generally obtain similar scores in the 
subsequent courses. 


Let S, denote the set of senior students who have already enrolled 
on course c. For any senior student s, € S,, the following Pearson 
correlation coefficient is used to measure the course score 
similarity between student s and the senior student s,. 


Niec,s,(9si — Isi) (9s.i = I:,i) 
= = 2 
[Bieeuas Bs — Gsi)? [Beet (2s Fi Is,i) 
(11) 


where C,,_ denotes the courses that are enrolled by both students s 

and s, , gs; and g;; denote the grade of course i by students s and 

Ss respectively. g,; and g,.; denote the average grade of courses 

enrolled by students s and s,, respectively. Accordingly, the grade 

of the course c by student s can be predicted as follows. 
_ ses, 4(Is,c) x sim(s , Ss) 


aie Ys,€5, sim(s , Ss) 


sim(s,s,) = 


(12) 


where S, ; indicates the set of top-k similar senior students of s. It 
should be noticed that students often achieve inconsistent grades in 
the various courses they take, and different students may have 
varying grades deviations, i.e. the grades deviation compared with 
the average grades among all students. Similarly, different courses 
may have varying grades deviations, i.e. the score deviation 
compared with the average score among all courses. In order to deal 
with those variations. We use the grade deviation of student s and 
the grade deviation of course c to predict student grades. 
Accordingly, equation (12) could be rewritten as below. 


Us, €5, 4 (Is,c =z Ds.c) x sim(s , Ss) 


Xs,€5,4 sim(s , Ss) 


Isc = Dse +f (13) 
where b,, = u+b, +b, denotes the baseline estimate for g,, 
with being the overall mean grade of all courses enrolled by all 
students, b, = g, — wu being the grade deviation of student s and 
be = Gc — \ being the grade deviation of course c, where g, is 
the overall mean grade of student s and g, is the overall mean 
grade of course c. 


Finally, we could use the grades that students are expected to obtain 
in future courses to boost the performance of our recommendation. 
The final Grade based Scores, GS(s,c) can be defined as 
normalized values of grades. 


Isc 
Max¢rec(Gscr) 


The total score of student and course pair score (s,c) can be 
written as: 


Score =a x IS(s,c) +B XTS(s,t) +y X GS(s,c) (15) 


Where a, £, y are parameters to control the proportion of weights 
from different sources. By taking those scores into account 
simultaneously, a course that the student interested in, and suitable 
for him to take to get a high grade could be ranked higher than other 
courses. Also, student could control the weighting of those 
components to have a better understanding of the data and decision- 
making. 


GS(s,c) = (14) 
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3.3 Course Recommendation Algorithm 

The whole framework can be written as Algorithm 1. Student set S, 
course set C, enrollment set E(s,c,g,t) are input and output is a 
list of recommendations Rs for student’s next semester that 
includes up to k recommendations per student. 


Algorithm 1: Generating a list of course recommendations for 
student 
Input : 
Student set S, course set C, enrollment set E(s,c, g,t); 
Output : 
Recommendation results for each student Rs. 


Calculate student interest p, foreach s € S by equation 1; 
Calculate student interest similarity by equation 4; 
Calculate student grade similarity by equation 11; 
Calculate student deviation of each student; 
Calculate course grade deviation of each course; 
foreachs € S do 
Calculate user interest-based score IS(s,c) by equation 7; 
Calculate timing-based score TS(c, t) by equation. 10; 


C©CmrrAA Nn WN = 


Calculate user grade-based score GS(s,c) by equation 13 
and equation 14; 


10 Calculate final score Score (s,c,t) by equation 15; 


11 Let Rs be the sorted list of C ordered by its Score (s,c,t) in 
descending order. 


12 endfor 


4. EVALUATION 


In this section, we conducted a series of experiments to evaluate the 
effectiveness of our proposed method. We first describe the dataset 
and experimental settings. Next, the evaluation methodology and 
metrics are introduced in detail. Finally, the results are shown in 
Section 4.4. 


4.1 Dataset 


This work focuses on undergraduate students in a traditional 
educational institution. We used a dataset from our university that 
spans for 5 years. The dataset consisted of per-semester course 
enrollment information of 2,366 students from 12 departments, 
with a total of 38,968 pseudonymized enrollment records from 
2014 through 2018. Each row of the course enrollment data 
contained semester and department information, an anonymous 
student ID and course information included course name, instructor 
and course category. 


4.2 Experiment Settings 


4.2.1 Data selection 

The most natural approach to evaluate the model is to split the data 
by semesters. As shown in Figure 1(b), most of the undergraduate 
students may take courses in the first two years. Therefore, for 
students who enrolled in 2015, the semester of Spring 2015 was 
used for training, the subsequent semesters of Fall 2015, Spring 
2016 and Fall 2016 are regarded as the testing semesters, each of 
which is tested separately. The results are evaluated by comparing 
the predicted courses and the ground-truth courses he/she has 
enrolled in. 


4.2.2. Comparison 

We name our methods as Hybrid Course Recommendation (HCR). 
We compare our method with two group popularity approaches [14] 
and Random recommendation (Random). The two group popularity 
approaches including the department level (Grp-Pop-1), which 
recommend the most popular courses in the major, and the 
academic level (Grp-Pop-2), which recommend the most popular 
courses on the major and the academic level of the student 


99 66, 99 Gee 


(‘“freshmen’”, “sophomores”, “juniors”, and “seniors”). 


4.3 Evaluation Metrics 
Like previous work [11,14,15,20], we used Recall@ns and 
Coverage as the evaluation metric for the performance. 


Coverage is measured based on the percentage of courses that have 
been recommended at least once to students, which describes the 
ability of a recommendation system to explore the long-tail item. 


Recall@ns is the percentage of actually enrolled courses of s in 
semester t that were contained in the recommendation list, where 
ns is the number of courses that the student took in the target 
semester. The reported metrics are averaged out across all students. 
Since our proposed course recommendation method considers both 
student interest and the grade he/she may obtain, we cannot only 
use the Recall metric, and instead, we use a variation of it. For the 
list of the courses R, that recommended to a student s, Let T;, is the 
set of courses in the test set of s, A, is the set of courses which 
student is expected to get the grade equal to or higher than his/her 
average previous grade. We use the ratio of |[R, NT,;N A,|and 
|T, A. As| to measures the fraction of the actual well performed 
courses that are retrieved. 


4.4 Results 


4.4.1 Interest-based Score 

In collaborative filtering strategy, taking how many similar 
students as neighbors is an important problem which is sensitive to 
the quality of the result. We investigate the performance of our 
interest model with different neighbor numbers. As shown in 
Figure 4, the performance of the model increases with the increase 
of neighbor number at first then decreases. According to the 
observation above, we pick a practical value 40 as the value of the 
neighbor number parameter in our follow-up experiments. 


We also investigate the performance of our interest model with 
different weights between similarity and major information. As 
shown in Figure 5, we can observe that the performance of the 
model increases with the decrease of A in terms of Recall. The 
reason is that the model considers not only the similarity but also 
the major information. That is, each major has its own preference 
for courses enrolling, major information will improve the 
performance of the algorithm. 


However, 100% recall could be bad because the system just 
recommends what students do anyway. We noticed that the 
Coverage also decreases with the decrease of A. The model seems 
benefit from the major information while scarifying the diversity of 
results. Recommendations for courses at other departments 
sometimes are useful to mine more long-tail student interest while 
students usually ignored that these courses existed or that their 
content matched their interests. To achieve the best performance 
of recommendation, we need to make a trade-off. According to the 
observation above, we set A as 0.2 in our follow-up experiments. 
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Figure 4. Performance of different neighbor numbers. 


Effect of Major Information 
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Figure 5. Evaluation of major information. 


4.4.2 Influence of different factors 

To illustrate the influence of different factors, we set each 
parameter a, f8,y from 0 to 1 with a step size 0.05 to find the 
optimal combination. 


Table 2. Performance of different parameters 


(a, B.Y)=| (a By) | (BY) x 

Model /*(1,0,0) | =(0,1,0) | =(,0,1) | 8) 

Recall@ns 0.380 0.412 0.341 0.434 
Recall(a)@ns 0.311 0.315 0.332 0.322 
Coverage 0.534 0.212 0.356 0.516 


As shown in Table 2, the interest score and timing score has a good 
explanatory value for the recommendation than others. Also, the 
suitable timing of taking a course will help students to get a good 
grade. A recommendation only based on the grade has a good 
performance for recommending high grade courses. However, the 
results cannot help all the students. We reached the best Recall@ns 
with (a =0.4, 8 =0.45 and y =0.15)*. 


The results indicate that recommendations that are aimed only at 
one factor are likely not to be satisfied by every student. As we 
discussed before, different students may have completely different 
orientations based on their own reasons, which serves as different 
criteria such as their preferences, interests, needs, performance, 
etc. Such a hybrid system could provide explanations and user 


controls for different categories of target students to support the 
interpretation of the data and decision-making. 


Table 3. Evaluation of course recommendation 


Semester | Model | Recall@ns |Recall(a)@ns| Coverage 
Fall 2015 | Random 0.048 0.036 - 
Fall 2015 |Grp-Pop-1} 0.374 0.306 0.272 
Fall 2015 |Grp-Pop-2} 0.452 0.342 0.342 
Fall 2015 HCR 0.472 0.393 0.578 
Spring 2016] Random 0.025 0.020 7 
Spring 2016/Grp-Pop-1) 0.325 0.201 0.305 
Spring 2016|]Grp-Pop-2| 0.423 0.372 0.237 
Spring 2016] HCR 0.431 0.402 0.342 
Fall 2016 | Random 0.002 0.002 = 
Fall 2016 |Grp-Pop-1] 0.326 0.243 0.213 
Fall 2016 |Grp-Pop-2} 0.441 0.387 0.250 
Fall 2016 | HCR 0.463 0.392 0.559 


4.4.3. Comparison result 

We analyze the performance of different algorithms. The results in 
Table 3 show that our framework performs well when compared 
with other methods. 


As the results show, both of the Recall and Recall(a) of Random 
recommendation strategies are very low since there are a large 
number of courses, but each student only averagely chooses a few 
courses per semester. Hence, it is difficult to recommend the right 
course. Popularity approaches are having considerably satisfactory 
performance in Recall since popular courses which are taken by 
students frequently usually attract most of students. However, Grp- 
Pop-1 and Grp-Pop-2 do not consider student preference, it is also 
difficult to mine more long-tail student interest as the Coverage is 
low. In addition, Grp-Pop-1 and Grp-Pop-2 are not good in Recall(a) 
since they only consider the popular courses, ignore the 
performance the student is expected to get in the recommended 
courses. 


5. CONCLUSION 


This research aims to recommend suitable courses for learners and 
study how to design a personalized course recommendation in the 
university environments. In this paper, we propose a hybrid course 
recommendation framework that considers student interest, the 
timing and popularity of courses, and predicted performance of 
students, simultaneously. Experiments are conducted to confirm 
the effectiveness of the proposed approach. The results show that 
the proposed hybrid course recommendation approach performed 
well compared to other methods. Also, the model itself is flexible 
in the sense that one can easily adjust or extend it by changing the 
recommendation formula and incorporate more information. 
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